643 research outputs found
Systematic discovery of structural elements governing stability of mammalian messenger RNAs.
Decoding post-transcriptional regulatory programs in RNA is a critical step towards the larger goal of developing predictive dynamical models of cellular behaviour. Despite recent efforts, the vast landscape of RNA regulatory elements remains largely uncharacterized. A long-standing obstacle is the contribution of local RNA secondary structure to the definition of interaction partners in a variety of regulatory contexts, including--but not limited to--transcript stability, alternative splicing and localization. There are many documented instances where the presence of a structural regulatory element dictates alternative splicing patterns (for example, human cardiac troponin T) or affects other aspects of RNA biology. Thus, a full characterization of post-transcriptional regulatory programs requires capturing information provided by both local secondary structures and the underlying sequence. Here we present a computational framework based on context-free grammars and mutual information that systematically explores the immense space of small structural elements and reveals motifs that are significantly informative of genome-wide measurements of RNA behaviour. By applying this framework to genome-wide human mRNA stability data, we reveal eight highly significant elements with substantial structural information, for the strongest of which we show a major role in global mRNA regulation. Through biochemistry, mass spectrometry and in vivo binding studies, we identified human HNRPA2B1 (heterogeneous nuclear ribonucleoprotein A2/B1, also known as HNRNPA2B1) as the key regulator that binds this element and stabilizes a large number of its target genes. We created a global post-transcriptional regulatory map based on the identity of the discovered linear and structural cis-regulatory elements, their regulatory interactions and their target pathways. This approach could also be used to reveal the structural elements that modulate other aspects of RNA behaviour
A Comprehensive Genetic Characterization of Bacterial Motility
We have developed a powerful experimental framework that combines competitive selection and microarray-based genetic footprinting to comprehensively reveal the genetic basis of bacterial behaviors. Application of this method to Escherichia coli motility identifies 95% of the known flagellar and chemotaxis genes, and reveals three dozen novel loci that, to varying degrees and through diverse mechanisms, affect motility. To probe the network context in which these genes function, we developed a method that uncovers genome-wide epistatic interactions through comprehensive analyses of double-mutant phenotypes. This allows us to place the novel genes within the context of signaling and regulatory networks, including the Rcs phosphorelay pathway and the cyclic di-GMP second-messenger system. This unifying framework enables sensitive and comprehensive genetic characterization of complex behaviors across the microbial biosphere
Genetic Architecture of Intrinsic Antibiotic Susceptibility
BACKGROUND:Antibiotic exposure rapidly selects for more resistant bacterial strains, and both a drug's chemical structure and a bacterium's cellular network affect the types of mutations acquired. METHODOLOGY/PRINCIPAL FINDINGS:To better characterize the genetic determinants of antibiotic susceptibility, we exposed a transposon-mutagenized library of Escherichia coli to each of 17 antibiotics that encompass a wide range of drug classes and mechanisms of action. Propagating the library for multiple generations with drug concentrations that moderately inhibited the growth of the isogenic parental strain caused the abundance of strains with even minor fitness advantages or disadvantages to change measurably and reproducibly. Using a microarray-based genetic footprinting strategy, we then determined the quantitative contribution of each gene to E. coli's intrinsic antibiotic susceptibility. We found both loci whose removal increased general antibiotic tolerance as well as pathways whose down-regulation increased tolerance to specific drugs and drug classes. The beneficial mutations identified span multiple pathways, and we identified pairs of mutations that individually provide only minor decreases in antibiotic susceptibility but that combine to provide higher tolerance. CONCLUSIONS/SIGNIFICANCE:Our results illustrate that a wide-range of mutations can modulate the activity of many cellular resistance processes and demonstrate that E. coli has a large mutational target size for increasing antibiotic tolerance. Furthermore, the work suggests that clinical levels of antibiotic resistance might develop through the sequential accumulation of chromosomal mutations of small individual effect
Accurate proteome-wide protein quantification from high-resolution 15N mass spectra
In quantitative mass spectrometry-based proteomics, the metabolic incorporation of a single source of 15N-labeled nitrogen has many advantages over using stable isotope-labeled amino acids. However, the lack of a robust computational framework for analyzing the resulting spectra has impeded wide use of this approach. We have addressed this challenge by introducing a new computational methodology for analyzing 15N spectra in which quantification is integrated with identification. Application of this method to an Escherichia coli growth transition reveals significant improvement in quantification accuracy over previous methods
The Iterative Signature Algorithm for the analysis of large scale gene expression data
We present a new approach for the analysis of genome-wide expression data.
Our method is designed to overcome the limitations of traditional techniques,
when applied to large-scale data. Rather than alloting each gene to a single
cluster, we assign both genes and conditions to context-dependent and
potentially overlapping transcription modules. We provide a rigorous definition
of a transcription module as the object to be retrieved from the expression
data. An efficient algorithm, that searches for the modules encoded in the data
by iteratively refining sets of genes and conditions until they match this
definition, is established. Each iteration involves a linear map, induced by
the normalized expression matrix, followed by the application of a threshold
function. We argue that our method is in fact a generalization of Singular
Value Decomposition, which corresponds to the special case where no threshold
is applied. We show analytically that for noisy expression data our approach
leads to better classification due to the implementation of the threshold. This
result is confirmed by numerical analyses based on in-silico expression data.
We discuss briefly results obtained by applying our algorithm to expression
data from the yeast S. cerevisiae.Comment: Latex, 36 pages, 8 figure
Global Networks of Trade and Bits
Considerable efforts have been made in recent years to produce detailed
topologies of the Internet. Although Internet topology data have been brought
to the attention of a wide and somewhat diverse audience of scholars, so far
they have been overlooked by economists. In this paper, we suggest that such
data could be effectively treated as a proxy to characterize the size of the
"digital economy" at country level and outsourcing: thus, we analyse the
topological structure of the network of trade in digital services (trade in
bits) and compare it with that of the more traditional flow of manufactured
goods across countries. To perform meaningful comparisons across networks with
different characteristics, we define a stochastic benchmark for the number of
connections among each country-pair, based on hypergeometric distribution.
Original data are thus filtered by means of different thresholds, so that we
only focus on the strongest links, i.e., statistically significant links. We
find that trade in bits displays a sparser and less hierarchical network
structure, which is more similar to trade in high-skill manufactured goods than
total trade. Lastly, distance plays a more prominent role in shaping the
network of international trade in physical goods than trade in digital
services.Comment: 25 pages, 6 figure
Cancer cells exploit an orphan RNA to drive metastatic progression.
Here we performed a systematic search to identify breast-cancer-specific small noncoding RNAs, which we have collectively termed orphan noncoding RNAs (oncRNAs). We subsequently discovered that one of these oncRNAs, which originates from the 3' end of TERC, acts as a regulator of gene expression and is a robust promoter of breast cancer metastasis. This oncRNA, which we have named T3p, exerts its prometastatic effects by acting as an inhibitor of RISC complex activity and increasing the expression of the prometastatic genes NUPR1 and PANX2. Furthermore, we have shown that oncRNAs are present in cancer-cell-derived extracellular vesicles, raising the possibility that these circulating oncRNAs may also have a role in non-cell autonomous disease pathogenesis. Additionally, these circulating oncRNAs present a novel avenue for cancer fingerprinting using liquid biopsies
In silico Analysis of Combinatorial microRNA Activity Reveals Target Genes and Pathways Associated with Breast Cancer Metastasis
This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited
Robust Detection of Hierarchical Communities from Escherichia coli Gene Expression Data
Determining the functional structure of biological networks is a central goal
of systems biology. One approach is to analyze gene expression data to infer a
network of gene interactions on the basis of their correlated responses to
environmental and genetic perturbations. The inferred network can then be
analyzed to identify functional communities. However, commonly used algorithms
can yield unreliable results due to experimental noise, algorithmic
stochasticity, and the influence of arbitrarily chosen parameter values.
Furthermore, the results obtained typically provide only a simplistic view of
the network partitioned into disjoint communities and provide no information of
the relationship between communities. Here, we present methods to robustly
detect coregulated and functionally enriched gene communities and demonstrate
their application and validity for Escherichia coli gene expression data.
Applying a recently developed community detection algorithm to the network of
interactions identified with the context likelihood of relatedness (CLR)
method, we show that a hierarchy of network communities can be identified.
These communities significantly enrich for gene ontology (GO) terms, consistent
with them representing biologically meaningful groups. Further, analysis of the
most significantly enriched communities identified several candidate new
regulatory interactions. The robustness of our methods is demonstrated by
showing that a core set of functional communities is reliably found when
artificial noise, modeling experimental noise, is added to the data. We find
that noise mainly acts conservatively, increasing the relatedness required for
a network link to be reliably assigned and decreasing the size of the core
communities, rather than causing association of genes into new communities.Comment: Due to appear in PLoS Computational Biology. Supplementary Figure S1
was not uploaded but is available by contacting the author. 27 pages, 5
figures, 15 supplementary file
MicroRNAs targeting oncogenes are down-regulated in pancreatic malignant transformation from benign tumors
BACKGROUND
MicroRNA (miRNA) expression profiles have been described in pancreatic ductal adenocarcinoma (PDAC), but these have not been compared with pre-malignant pancreatic tumors. We wished to compare the miRNA expression signatures in pancreatic benign cystic tumors (BCT) of low and high malignant potential with PDAC, in order to identify miRNAs deregulated during PDAC development. The mechanistic consequences of miRNA dysregulation were further evaluated.
METHODS
Tissue samples were obtained at a tertiary pancreatic unit from individuals with BCT and PDAC. MiRNA profiling was performed using a custom microarray and results were validated using RT-qPCR prior to evaluation of miRNA targets.
RESULTS
Widespread miRNA down-regulation was observed in PDAC compared to low malignant potential BCT. We show that amongst those miRNAs down-regulated, miR-16, miR-126 and let-7d regulate known PDAC oncogenes (targeting BCL2, CRK and KRAS respectively). Notably, miR-126 also directly targets the KRAS transcript at a "seedless" binding site within its 3'UTR. In clinical specimens, miR-126 was strongly down-regulated in PDAC tissues, with an associated elevation in KRAS and CRK proteins. Furthermore, miR-21, a known oncogenic miRNA in pancreatic and other cancers, was not elevated in PDAC compared to serous microcystic adenoma (SMCA), but in both groups it was up-regulated compared to normal pancreas, implicating early up-regulation during malignant change.
CONCLUSIONS
Expression profiling revealed 21 miRNAs down-regulated in PDAC compared to SMCA, the most benign lesion that rarely progresses to invasive carcinoma. It appears that miR-21 up-regulation is an early event in the transformation from normal pancreatic tissue. MiRNA expression has the potential to distinguish PDAC from normal pancreas and BCT. Mechanistically the down-regulation of miR-16, miR-126 and let-7d promotes PDAC transformation by post-transcriptional up-regulation of crucial PDAC oncogenes. We show that miR-126 is able to directly target KRAS; re-expression has the potential as a therapeutic strategy against PDAC and other KRAS-driven cancers
- β¦